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Abstract ~ The purpose of this work is to extend results of previous papers [13|, 
Q to quantum channels with additive constraints onto the input signal, by showing 
that the capacity of such channel is equal to the supremum of the entropy bound 
with respect to all apriori distributions satisfying the constraint. We also make 
an extension to channels with continuous alphabet. As an application, we prove 
the formula for the capacity of the quantum Gaussian channel with constrained 
energy of the signal, establishing the asymptotic equivalence of this channel to 
the quasiclassical photon channel, and derive the lower bounds for the reliability 
function of the pure-state Gaussian channel. 



I. The case of discrete alphabet 

Most of the results concerning the capacity of quantum communication 
channels were proved for the case of finite input alphabets 0, [|TT|, [|I^]. 
The importance of considering channels with infinite (continuous) alphabets 
and with constrained inputs was clear from the beginning of quantum com- 
munications 0, Ijl^, and was reiterated in |T^. The present paper, which is 



a continuation of our papers [0, 0, is devoted to study of this case. 

For reader's convenience we start with repeating some basic notions from 
Jl^, making necessary modifications for channels with constrained inputs. 
Let ?i be a Hilbert space. A quantum communication channel with (possibly 
infinite) discrete input alphabet A = {i} consists of the mapping i ^ Si 
from the input alphabet to the set of density operators (d. o.) in ?-^. The 
input is described by an apriori probability distribution vr = {tTj} on A. 
At the output there is a quantum measurement in the sense of 0, given 
by resolution of identity in Ti, that is by a family X = {Xj} of positive 



^Visiting Research Scholar at the Department of Electrical and Computer Engineering, 
Northwestern University. 



1 



operators in Ti satisfying Y^j Xj = /, where / is the unit operator in H. 
The index j runs through some discrete output alphabet, which is not fixed 
here. The conditional probability of the output j if the input was i equals to 
-P(jK) — TrS'jXj. The Shannon information is given by the classical formula 

/.(,.A-)^i:i:,.P(,i.)iog(^^J|^). (1) 

Let f{i) be a function defined on the input alphabet. We shall restrict 
the apriori distributions vr by the inequality 

E/(^K(^)<^, (2) 

i 

where £" is a real number, and denote the class of such probability distribu- 
tions by Vi. 

Let us consider also the product channel in the Hilbert space Ti*^" = 
7i® ... ®7i with the input alphabet consisting of words u = (zi, i^) of 
length n, and the d. o. 

Su = Si^ (g) ... (g) Si^ 

corresponding to the word u. If vr is a probability distribution on A"' and 
X is a resolution of identity in we define the information quantity 

J„(7r,X) by the formula similar to (|T]). We put the additive constraint onto 
the distribution vr by asking 

E [/(h) + • • • + fi^nMn, ...,in)< nE, (3) 

ii,...,in 

and denote by Vn the class of probability distributions satisfying this con- 
straint. 
Defining 

Cn= sup /„(7r,X), (4) 

we have the property of superadditivity Cn + Cm < Cn+m, hence the following 
limit exists 

C = lim Cn/n, (5) 

which we call the capacity of the initial channel. (This quantity may be 
eventually infinite for infinite dimensional Hilbert space and in what follows 
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we consider the nontrivial case C < oo.). The definition is justified by the 
classical Shannon's coding theorem for channels with constrained inputs. 
Namely, call by code of size N a sequence {ui,Xi), {uN,X]\f), where Uk 
are words of length n, and {X^} is a family of positive operators in ?-^®", 
satisfying Y^jLi^j ^ I- Defining Xq = I — Y^jXj, we have a resolution of 
identity in Ti^^. An output k > 1 means decision that the word Uk was 
transmitted, while the output is interpreted as evasion of any decision. The 
average error probability for such a code is 

1 ^ 

^ = T7E[1-Tr^".^.]. (6) 

k=i 

Let us denote p{n, N) the minimum of this error probability with respect to 
all codes of the size N with words of length n, satisfying the condition 

f(i,) + ... + f{i^)<nE. (7) 

As a direct consequence of theorems in Sec. 7.3 of 0|, one can prove the 
following statement providing information-theoretic justification of the defi- 
nition (^: 

Proposition 1. If R < C , then p(n, e"^) , and if R > C , tlien 
j9(n,e"^) 74 0. 

Let H{S) = — TrS'logS' be the von Neumann entropy of a d. o. and let 
71 = {tTj} be an apriori distribution on A. We denote 

and assume that 

sup H{S^) < 00. (8) 

Further, we denote 

AH{n) = HiS^)-Y,7r,H{S,). (9) 

Proposition 2. Under tie condition @ 

C = sup AH {71). (10) 
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Proof. To prove the < part, take vr G P„. The entropy bound |0, |T6 
imphes 

/„,(7r,X) < Ai7„(7r), 

where Aif„(7r) is the analog of AH^tt) for the product channeL According 
to the subadditivity property of quantum entropy 



k=l 



where vr'-'^^ is the fc-th marginal distribution of tt on A. Therefore 
1 1 " 

-4(7r,X) <-Y.AHin^'^) < Ai7(vr), 
n n^^ 

where vf = ^ Z]fc=i tt'-'^'*, since AH^tt) is concave function of tt 0. Also 
inequality (0) can be rewritten as 

n 

Y.j:fi'kW\'k)<nE. 

k=l ik 

It follows that n^^Cn < sup^g-p^ AH{tt) and hence, a similar inequality holds 
for C . 

To prove the > part, we use the random coding modified for constrained 
inputs. Let vr be a distribution satisfying (^, and let P be a distribution on 
the set of M words, under which the words are independent and 

P{U = {ii, . . .,in)) =TTi^- TCi^. 

Let z/„ = P(- J2k=i fi}k) < E) and define the modified distribution 



P{u = (ii, . . . 



0, otherwise. 



Let us remark that if tt G Vi, then M/ < E (where M (M) is the expectation 
corresponding to P (P)) and hence by the central limit theorem 

lim Un > 1/2. 
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Therefore < 2™M^ for any nonnegative random variable ^ depending on 
m words. In particular, for the error probability (^) we gave in [13| the upper 
bound (17) depending on arbitrary two words, the expectation of which with 
respect to P can be made arbitrarily small provided M = e"^,n — > oo, 
with R < C. Thus MA also can be made arbitrarily small under the same 
circumstances. The proof in is for finite dimensional Hilbert space, but 
under the condition @ it can be modified for infinite dimensions. Since 
the distribution P is concentrated on words satisfying (|^), we can choose a 
code satisfying this constraint for which A can be made arbitrarily small. 
Proposition 1 then implies that 



C > sup AH{tt) 



which completes the proof. 



II. The case of continuous alphabet 

In this section we take as the input alphabet A arbitrary Borel subset 
in a finite-dimensional Euclidean space S. We assume that the channel is 
given by weakly continuous mapping x —>■ Sx from the input alphabet A to 
the set of density operators in Ti. (The weak continuity means continuity 
of all matrix elements < ip \ Sx(p>;4',(p G H). We assume that a continuous 
function / on £^ is fixed and consider the set Vi of probabihty measures vr on 
A satisfying 

/ f{x)7r{dx) < E. (11) 

J A 

Like in the classical case, we discretize the channel by taking apriori 
distributions with discrete supports 

7i{dx) ='^ni6ccXdx), (12) 

i 

where {xi} C A is arbitrary countable collection of points and 

Sx{B) -- 



1, if X G B, 
0, if x^B, 



and by taking discrete resolutions of identity {Xj}. For tt of the form (|l^) 
the constraint (pAj ) takes the form (Q). Then we define the capacity of the 
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channel x — Sx with the constraint (|TT]) by repeating the argument in Sec. 
1 with the only modifications that 



P(jK)=Tr5.,X, 



J' 



and additional supremum in is taken over all possible choices of the points 

Xi G A. 

For arbitrary vr G Pi consider the quantity 

AH{n) = H{S^) - [ H{S.x)Tx{dx), (13) 

J A 

where 

S^= j SMdx). (14) 



Because of the weak continuity of the function the integral is well defined 
and represents a density operator in Ti.. Moreover, from the Lemma below 
it follows that the nonnegative function H{Sx) is lower semicontinuous, and 
hence the second term in (|13D is also well defined . 

Proposition 3. Under the condition in which S*^ is given by ( jl^ ) 
and Vi - by (0;, 

C = sup AH{tt). (15) 

Proof. The < part of the proof follows obviously from Proposition 2, as 
for TT given by ([121) the quantity (|13D turns into (H). To prove the > part it 
is sufficient to construct, for arbitrary vr G "Pi, a sequence of discrete vr^'^ G Vi 
such that 

lim inf AH{tt^^^) > AH{n). (16) 

>oo 

To this end for any Z = 1, ... we consider the division of A into disjoint 
subsets 

Bj!^ = {x:k/l<H{Sx)<{k + l)/l}, fc = ...,-1,0,1,... . (17) 

By making, if necessary, a finer subdivision, we can always assume that 
diameters of all sets Bj!^ are bounded from above by e^, where e/ — as 
/ — >■ cxD. Let x^[^ be a point at which f{x) achieves its minimum on the 
closure B^!^ of 5^, and define 

7r«(dx) = 5:7r(i?f)5.(o(t/x), (18) 
k * 
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where vr is a fixed distribution from Vi. Then 

f\x)Ti^^\dx) < f f{x)7r{dx), 



lience vr^'^ G Vi. 

By construction (|17D , ( p^Sf ) and due to tlie condition (||) we liave 



H{S^)7r^^\dx) 
and it remains to sliow tliat 



H{Sx)'K{dx) 



lim mf H (^J^S^TT^^\dx)j > H (^J^S^n{dx] 



(19) 



We first remark that due to the weak continuity and uniform boundedness 
of the function Sx, the density operators X4 SxT^'^^\dx) weakly converge to the 
density operator Sx7i{dx). Indeed, let Be be the ball of radius c in S. Then 



< 



<E 



SxT^^^\dx)ip > - < 01 / SxT^{dx)ip > 

J A 

< <j)\Smij >-< <f)\SxiJ > \7r{dx) + 2II0II ||^||7r(A \ B,). 



By choosing first c large enough to make the second term small, we can 
make the first term small for all large enough I since < (plSxip > is uniformly 
continuous on A fl -Be and the diameters of B^^ uniformly tend to zero. 

It remains to apply the following Lemma (this result is well known but 
we include sketch of its proof for completeness): 

Lemma. Let {Ai} be a sequence of density operators, weakly converging 
to a density operator A. Then 

lim inf H{Ai) > H{A). 
Proof. Let {Pm} be a monotonely increasing sequence of finite-dimensional 



projections weakly converging to unit operator in Ti.. By Lemma 4 from [|15 
the sequence 

HiPraSPj + TlP^,SPrJogTlP^SPm 

monotonely converges to -^(5') for any d. o. S. Then we have 
lim inf H{Ai) = lim inf lim H{P^AiP^) > lim lim H{PmAiPm) = H{A) 



Z— >oo 



This completes the proof of (|T^) and hence of Proposition 3. 
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III. The quantum Gaussian channel 

WITH constrained ENERGY OF THE SIGNAL 

Let A be the complex plane C, and let for every a G C the density 
operator Sa describe the thermal state of harmonic oscillator with the signal 
amplitude a and the mean number of the noise quanta N, i. e. 

^„ = -L / exp (-^^) \z X z\dh, (20) 

where \z > are the coherent state vectors. This is quantum analog of channel 
with additive Gaussian noise (see 0,0, [[131, [§[). We remind for future use 
that 

Sa = Via)SoViay, 

where 

V{a) = exp(aa^ — aa) 

are the unitary displacement operators, a\ a being the creation - annihilation 
operators for the harmonic oscillator, and the operator Sq has the spectral 
representation: 



Sn 



1 ^ / 



-V Inxnl, (21) 



where \n > are the eigenvectors of the number operator a^a. The states (^) 
all have the same entropy 

H{Sa) = {N + 1) \og{N + 1) - AT log AT, (22) 
and the mean number of quanta 

trSaa^a = N +\a\'^. (23) 
We impose the following constraint onto the mean energy of the signal 

J \a\'^rr{(fa) < E, (24) 

where 7r((i^a) is an apriori distribution. In fact, E is the "mean number of 
quanta" in the signal, which is proportional to energy for one mode. Consider 
the density operator 
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The constraint ( p^ by virtue of p3| ) implies 

Tr a^a<N + E. (25) 
It is well known that under this constraint the maximal entropy 

H{S^) = {N + E + l) \og{N + E + l)-{N + E) \og{N + E) (26) 
is attained by Gaussian density operator 

= / / exp ( --J^— ) \z >< z\(fz, (27) 



'n{N + E)J \ {N + E) 
corresponding to the optimal apriori distribution 

■n{d^a) = ^ exp ( ) d^a. (28) 

ttE \ E j 

Hence the condition of Proposition 3 is fulfilled, and the capacity of the 
channel is equal to 

C = H{S^) - H{S^) = log (l + 

+ (iV + E)log(l + ^)-iVlog(l + l 

This quantity was anticipated in (relation (4.28) ) as an upper bound 
for the information transmitted by the quantum Gaussian channel. On the 
other hand, for a long time this quantity was also known as the capacity of the 
"narrow band photon channel" 0, Our argument based on Proposition 
3 gives for the first time the proof of the asymptotic equivalence, in the sense 
of information capacity, of the Gaussian channel with the energy constraint 
( |24D to this quasiclassical channel. To make the point clear, we give below a 
simplified one-mode description of the photon channel. 

Consider the discrete family of states 

Sm = P{m)SoP{m)\ m = 0, 1, . . . , (29) 

where P{m) is energy shift operator satisfying P{m)\n >= |n + m > . Notice 
that P{m) = P™, where P is isometric operator adjoint to the quantum- 



mechanical "phase operator" [12 . The states Sm all have the same entropy 



(^4 ) as the states Sa, and the mean number of quanta 

tiS^rn a^a = N + m. (30) 
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Moreover, all states (^) are diagonal in the number representation, in which 
sense the channel may be called quasiclassical. 
Imposing the constraint 



^ rmrra< E, (31) 

m=0 

where vr^ is an apriori distribution, and introducing the density operator 

oo 



m=0 



by virtue of 



we obtain the same constraint ( pS]) for the new operator 

which 



S'^. The maximal entropy (pGj) is again attained by the operator 
has the spectral representation 



S-jr 



N + E + 



^ n=0 



N + E 
N + E + 1 



\n >< n\ 



(32) 



It corresponds to the optimal apriori distribution |T| 

1 



N + E '^''^ N + E 



N+E+1 \N+E+1 



N + E 



There is notable difference between the case of pure-state channel as 
opposed to the general case. For a pure-state case (where = 0), one can 
formulate a broader problem of finding a maximum capacity channel x ^ Sx 
with arbitrary alphabet {x} and an apriori distribution iT{dx) satisfying the 
output constraint 

TiS^ a^a < E. 

This was done in WE\ where it was shown that the noiseless photon channel 
provides a solution to this problem. In view of the result of 0, any other 
pure-state channel satisfying 



S^iridx) 



E + 



1^ 

^ n=0 



E 



E + 1 



\n >< n\ 



gives, asymptotically, a solution to the same problem. However, in the general 
case imposing the output constraint (B^) instead of the input constraints (E3) 
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or ( |3TD looks rather artificial; the equivalence of these constraints for appar- 
ently different channels seems to be a very special feature of the quantum 
Gaussian density operators. 



IV. The upper bounds for error probability 

A much more detailed information concerning the rate of convergence of 
the error probability can be obtained for pure-state channels, by modifying 
the estimates from to channels with infinite alphabets and constrained 
inputs following the method of , Ch. 7. We start with the case of discrete 
alphabet. 

Let Si = \ipi >< ipi\ be the pure signal states of the channel, and let vr 
be an apriori distribution satisfying the restriction (^. Then the following 
random coding bound holds for the error probability p{n, N) where N = e"^ 
with R<C: 



p5 



2 



p(n,e"")<2 exp{-n[fi{7T,s,p) - sR]}, (33) 



(34) 



where 

/i(7r,s,p) = -logTr|^7r,eP[^«-^l^,| , 
and 0<s<l,0<j9, 0<(5 are arbitrary parameters. The quantity 

n 

Un,s = PiEn -5<Y^ fitk) < nE) 

k=l 

satisfies lim„^oo \/ni'n,& > 0, thus adding only o(n) to the exponential in (|33|) . 

The bound (|33| ) is obtained in the same way as Proposition 1 in 0, that 
is by evaluating the expectation of the average error probability (^ using 
random, independently chosen codewords, but with the modified codeword 
distribution 



Ps{u = (ii, . . .,in)) 



J^ni^ii ■ • • • ■ 7ri„, ifnE-6 < Efc=i/(^fe) < nE, 
0, otherwise. 

(35) 
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The point is that for any random variable ^ depending on m words 
M,e< — Mexp{mpY.[fi^,)-E]}i, 



fe=l 



where p > 0. By using this inequahty after equation (14) in the proof of 
Proposition 1 from 0, and following argument in Ch. 7 of [Q, we can obtain 
the bound (|33D. 

In the same way, the proof of Proposition 2 from can be modified to 
obtain the expurgated bound 

2 2eP^ 

p(n,e"-f^) < exp{-n[jl{TT, s,p) - s{R + - log )]}, (36) 

where 

fi{7r,s,p) = -slog^Tr.TTfce^l^W+^C^^-^i^] | < ^.l^-fc > ^/^ (37) 

i,k 

These bounds can be extended to pure-state channels with continuous 
alphabets by using technique of Sec. II to obtain (^), (^) with 

/i(7r,s,p) = -logTr||^e^[^(^)-^l5,7r(da;)}'^', (38) 

fl{7r,S,p) = -slog / / QP[f{x)+f{y)-2E]^ ^ ^^1^^ > \^/'7T{dx)7T{dy). (39) 
J A J A 

Introducing the reliability function 

E{R) = lim sup — log ■ 



ji-»oo n p[n, e"-"j 

which characterizes the exponential rate of convergence of the error proba- 
bility, we get the lower bound for E{R): 

E{R) > max{Er{R),E,,{R)}, 

where 

Er{R) = max (maxmaxyu(7r, s,p) — sR), (40) 

0<s<l 0<p TrePi 

Eex{R) = max(maxmax/i(7r, s,p) — sR). (41) 

l<s 0<p ttGPi 

An example where the maximization at least partially can be performed 
analytically is considered in the following Section. 
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V. The reliability function 

OF QUANTUM GAUSSIAN PURE-STATE CHANNEL 

We are going to apply results of the previous Section to the Gaussian 
pure-state channel a ^ Sa = \a >< (y.\ with the constraint (plf). By taking 



the optimal apriori distribution ([28|) we can calculate explicitly the functions 



Namely, to calculate (^81), we remark that 

-pE 



M'-E)s^T,{d?z) = ^ I e-^|z X z\dh 

^ ' l-pE TiE' J ' ' 



-pE 00/ j^i \ 



\n >< n\ 



where E' = E/{1 — pE), provided p < E ^, and the trace of the (1 -|- s)-th 
power of this operator is easily calculated to yield 

fi{7i,s,p) = {1 + s)pE + \og[{l + E - pEY+' -E^+']. (42) 

By taking into account that 

I < z\w > r = e" 



|2 _ ^-\z-u,\^ 



see, e. g. 0), we can calculate the integral in (^) as 

exp{-[{E-^ + s'^ - p)\z\^ + {E-^ + s-^ - p)\w\^ - 2s'^Rezw]} 



^-2pE 



{nEy 



-2pE 



1 + p2^2 _ 2pE - 2pE^/s + 2E/s ' 

for p < E^^ , whence 

fi{7f,s,p) = s{2pE + \og[l+p'^E^ - 2pE + 2E{l-pE)/s]}. (43) 
Trying to maximize /i(7r, s,p) with respect to p we obtain the equation 
{l + E~pEy{l-p)=E', (44) 
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which can be solved exphcitly only for s = 0,1. Thus, contrary to the classical 
case 0] , the maximum in (^) in general can be found only numerically. For 
s = we have p = and 

C = ^/i(7r, 0, 0) = + 1) \og{E + 1)-E log E. 

OS 



For s = 1 equation (|4^) has the unique solution p(l, i?) = 1 + 1/ E—g{E)/ E < 
E^^, where 

i + vwn 
m = — - — . 

For future use we find the important quantities 

/i(7r, l,p{l, E)) = 2{E + 1- g{E)) + \ogg{E)- 

-/.(vr, l,p(l, E)) = E + 1- g{E) + ^ ^(r)2 _ jj;2 (45) 

The optimization of the expurgated bound can be performed analytically. 
Taking partial derivative with respect to p we obtain the equation 



the solution of which, satisfying p < E~^, is 

p{s, E) = + E'^ - E-^g{E/s). 

Substituting this in (^T]), we obtain the following expression, which is to be 
maximized with respect to s > 1: 

fi{7i, s,p{s,E)) - sR = 2{E + s - sg{E/s)) + s\ogg{E/s) - sR. 

Taking derivative with respect to s, we obtain the equation 

g{E/s) = e^, 

the solution of which is 

s = , ^ (46) 
Ve2« - e« ^ ' 



14 



If this is less than 1, which is equivalent to 

R<logg{E) = ^^fi{'K,l,p{l,E)), 

then the maximum is achieved for the value of s given by ( PB| ) and is equal 

to 



2E{1 - Vl - e-«) = E,,{R) > Er{R), 

(which up to a factor coincides with the expurgated bound for classical Gaus- 
sian channel). In the range 

where the optimizing s is equal to 1, we have the linear bound 

E,^{R) = EriR) = fii7T,l,p{l,E)) - R, 

with the quantities ^yu(vr, E)), fi{TT, E)) defined by (|45|) . Finally, 

in the range 

^/i(7r,l,p(l,E))<i?<C 

we have E^xiR) < Er{R) with Er{R) given implicitly by 

On the other hand, for the pure-state photon channel the analysis of 
the error probability is trivial: since this is quasiclassical noiseless channel, 
the error probability is zero for R < C. Thus, although the two channels 
are asymptotically equivalent in the sense of capacity, their finer asymptotic 
properties are apparently essentially different. 
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